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ABSTRACT 

Recently there has been a lot of attention focussed on a virialized halo-based approach to understanding 
the properties of the matter and galaxy power spectrum. A key ingredient in this model is the number and 
distribution of galaxies within dark matter halos as a function of mass. This quantity has been predicted 
from semi-analytic modeling and from fits to observational data. Here we present predictions for the 
occupation number and spatial distribution of sub-halos based on a high-resolution hydrodynamical 
simulation including cooling, star-formation and feedback. 

Subject headings: cosmology: theory - large-scale structure of universe 



1. INTRODUCTION 

For hierarchical models of structure formation based on 
the inflationary cold dark matter paradigm, the clustering 
and evolution of dark matter halos is now quite well un- 
derstood and can be reliably simulated using N-body tech- 
niques. In contrast, a theory for the formation and evolu- 
tion of galaxies is one of the central unsolved problems in 
cosmology. Such a theory should at the very least specify 
the relationship between galaxies and the dark matter ha- 
los in which they reside, and it is this question which we 
wish to address. 



Severa l auth ors (Ma & Fry gOOOaj, p000b| ; Seljak pOOOa ; 
Peacock 2000; Peacock & Smith 2000) have recently de- 
veloped a new way of describing the non-linear clustering 
of dark matter and galaxies. This formalism postulates 
that all the mass in the univer se lies in halos of various 
masses (Press & Schechter 1974| ) and further that on large 
scales the halos cluster according to linear theory while on 
small scales the power is do minated by ha lo profiles (Ney- 
man, Scott & Shane 1953; Peebles 1974). The galaxies 
are assigned to halos using simple rules. While this model 
requires many ingredients to be fixed by numerical ex- 
periments (typically N-body simulations or semi-analytic 
modeling) it provides a useful structure for thinking about 
gravitational clustering which gives i nsights into several 
outsta nding problems (see e.g. Sel jak 2000a; Peacock & 
Smith |2000| Selja k, Burwell & Pen g000|; AtFio-Bara ndcla 



White 2001: Scoccimarro et al. 



& Mu cket 1996 ; Cooray, H u fc M iralda-Escude 2000 



Seljak pOOObt Sheth et al. |2000| ) 



2001; Guzik& Seljak 2000 



Here we shall be primarily interested in the 2-point func- 
tion of galaxy clustering, either in Fourier space (the power 
spectrum) or real space (the correlation function). The 2- 
point function is one of the most fundamental quantities 
in large-scale structure. It is robust, but sensitive to sev- 
eral cosmological parameters such as the Hubble constant, 
the matter density and of course the primordial power 
spectrum. The key insight of the halo model is that an 
accurate prediction of this quantity requires a knowledge 
of the occupation number of galaxies in dark matter halo s 
and their spatial distribution (see also Benson et al. 2000|) . 
With only these ingredients the model can make predic- 
tions about a wide variety of quantities of interest. In ad- 
dition, these quantities allow one to "graft" galaxies onto 



pure N-body simulations of large-scale s truct ure to make 
mock galax y catalogues (Benson et al. [2000 ; Peacock & 
Smith |2000|) . 

The occupation number distribution, (N)(M) and (N(N 
1))(M) at the 2-point level, and the spatial distribution of 
galaxies thus make an ideal point of contact between ob- 
servations and theory. They can, in principle, be predicted 
from a model of galaxy formation, and they can be mea- 
sured obs ervatio nally. 

Seljak (2000a) measured (N)(M) from semi-an alytic 



models of structure formation (Ka uffma nn et al. 1999 
bu t see also Somerville & Primack |1999 and Benson et 



al. 2000) and used this to make predictions for the cluster- 
ing of galaxies. A similar procedure has been followed by 
Wechsler et al. (2000) where simulations are us ed in place 
of analytic tec hnique s. Peacock & Smith (2000) and Scoc- 
cimarro et al. ( (200l[ ) took the opposite approach of trying 
to infer this distribution from observations as a check on 
semi-analytic modeling. The agreement between the two 
methods is at most qualitative at present. Here we present 
predictions for these functions from a cosmological hy- 
drodynamic simulation which includes star-formation and 
feedback. 

2. THE SIMULATION 

Throughout, we s hall u se a new simulation of the Os- 
trikcr & Stcinhardt (1995) concordance model, which has 
n m = 0.3, n A = 0.7, H = lOO/ikms^Mpc" 1 with 
h = 0.67, £!b = 0.04, n = 1 and cr 8 = 0.9 (correspond- 
ing to Sh — 5.02 x 10~ 5 ). This model yields a reasonable 
fit to the current suite of cosmological constraints and as 
such provides a good framework for making realistic pre- 
dictions. 

We have used th e Tre e/SPH code Gadget (Springel, 
Yoshida & White |2000| ) to run a 2 x 300 3 = 54 mil- 
lion particle simulation of this model in a periodic box 
of size 33.5 /i _1 Mpc. Equal numbers of gas and dark mat- 
ter particles were employed, so mdark = 1 x 10 8 /i^ 1 M 
and TOgas = 1-5 x 10 7 ' hr l MQ and the gravitational soft- 
ening was 6/i _1 kpc, fixed in comoving coordinates. The 
simulation was started at redshift z = 99 and evolved to 
z = 1. 

In addition to the gravitational interactions and adia- 
batic hydrodynamics, the code follows radiative cooling 
and heating processes in the presence of a UV radiation 
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field in essentially the same way as described in Katz, 
Weinberg & Hernquist (1996). We mode l the UV radi- 
ation field using a Haardt & Madau ( 1996| ) spectrum with 
reionization at z = 6. Star formation (and feedback) is 
handled using a modi fication of the "multi-phase" mode l 
of Yepes et al. ( |1997| ) and Hultman & Pharasyn ( |1999| ). 
Each SPH particle is assumed to model a co-spatial fluid 
of ambient hot gas, condensed cold clouds, and stars. Hy- 
drodynamics is only followed for the hot gas phase, but the 
cold gas and stars are subject to gravity, add inertia, and 
participate in mass and energy exchange processes with 
the ambient gas phase. The algorithm will be described in 
more detail in a forthcoming paper. 

From the simulation outputs at z = 1 and 3 we have 
constructed catalogues of halos and their sub-halos using 
the algor ithm Subfind described in detail in Springel et 
al. (200C). First, the Friends-of-Friends algorithm (with a 
linking length of 0.1577," 1 / 3 ) is used to define a parent halo 
catalogue, and then bound sub-halos within each parent 
are identified. These sub-halos typically consist of cold, 
dense gas that has been able to efficiently cool and form 
stars and should be identified with galaxies in the real 
universe, at least statistically. 

We have used a linking length of 0.15 rather than the 
more canonical 0.2 in defining the parent halos, because 
we found for the larger linking length neighboring halos 
were being linked with one then identified as a sub-halo of 
the other. While this problem is not eliminated entirely 
using 0.15, it is significantly reduced. Only very close halo 
pairs or triplets, possibly in the process of merging, are 
joined with this linking length. In some instances the FOF 
algorithm finds halos which are not bound; these halos are 
not included in the analysis. 

At z — 1 there are a total of 31,737 halos and 34,529 sub- 
halos with an average star formation rate of 0.35M© yr _1 . 
At z = 3 the numbers are 37,940 and 39,579 respectively, 
with an average star formation rate of 0.25M Q yr _1 . 

3. HALO OCCUPATION NUMBER 

We show in Figs. [l], || the distribution A^ su b(M) from 
the simulation. Ideally we would identify the sub-halos 
as galaxies by some observational property, such as lu- 
minosity or color. Unfortunately we are unable to reli- 
ably compute such properties from this simulation. Even 
though we track the star-formation rate in the simulation, 
the outputs are too infrequent to graft on population syn- 
thesis codes, and any transformation we apply to the out- 
puts would simply introduce additional uncertainties and 
modeling into our predictions. Thus we have focussed on 
the number of sub-halos as a function of sub-halo (total) 
mass, sub-halo stellar mass and sub-halo "instantaneous" 
star-formation rate which we can reliably extract from the 
simulation. Additional modeling will be presented else- 
where. 

Since the FOF groups can be irregular and their mass 
difficult to interpret, we compute spherically averaged 
masses using the group particles. The center of the halo 
is taken to be the point of minimum potential, consider- 
ing the halo particles in isolation. This corresponds very 
closely to the densest particle and the most bound particle 
for all but the most disturbed systems. For the total halo 
mass we use the mass interior to a radius (^500) inside of 



which the mean density is 500 times the critical density. 
For some sub-halos the halo is truncated before this ra- 
dius in which case we use all of the bound mass in the 
halo. The stellar mass is simply the mass in stars within 
rsoo and the star formation rate is also summed over all 
star- forming particles within this radius. 

Note that for parent halos of a fixed mass there is a dis- 
tribution of sub-halo occupation numbers. Statistically 
the distribution seems to be quite close to Poisson for 
(N) 3> 1 but has less scatter than the Poisson predic- 
tion for (N) ^ 1, as is also seen in semi-analytic models. 
By definition all bound halos host at least one sub-halo. 
For all panels in Fig. |l| the distribution (N) is close to a 
power-law, with a slope of ~ 0.8 — 0.9, i.e. the number of 
sub-halos increases more slowly than the mass. At z = 3 
we again find that (N) is well fit by a power-law, with a 
slightly shallower slope than at z = 1, being closer to 0.7, 
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Fig. 1. — The number of sub-halos per FOF group as 
a function of parent halo mass in the simulation at z = 
1. (Top panel) There is 1 dot per halo. Also shown are 
(N) (long dashed) and y/(N(N - 1)) (short dashed) as a 
function of mass for: (upper lines) all sub-halos, (middle) 
sub-halos with M > 10 10 /i _1 M Q and (lower) sub-halos 
with M > 10 11 /i _1 M Q . Note that the number of sub- 
halos per parent is close to Poisson when (N) ^> 1 but sub- 
Poisson for small (N) , the mean is well approximated by a 
power-law and it increases more slowly than M. The ray 
of solid lines have slopes from 0.5 to 1 for comparison. The 
lower solid line is a prediction from semi-analytic modeling 
(see text). (Middle panel) As above but with cuts on stellar 
mass: (upper lines) all sub-halos, (middle) sub-halos with 
M stal > 10 9 /i _1 M Q and (lower) sub-halos with M star > 
10 10 /i _1 Mg. (Bottom panel) As above but with cuts on 
star-formation rate: (upper lines) all sub-halos, (middle) 
sub-halos with SFR > lAf Q /yr, (lower) sub-halos with 
SFR > 10M Q /yr. 
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again increasing more slowly than the mass. 

One could imagine that the decrease in (N)/M may 
arise because the larger halos have longer gas cooling 
times, because mergers are more efficient or because a 
larger fraction of the total mass lies "outside" of the sub- 
halos. In our simulation we find that the both the mean 
sub-halo mass (as a fraction of the parent halo mass) and 
the fraction of the halo mass in sub-halos decline with in- 
creasing parent halo mass. At the low mass end the mean 
sub-halo mass and the fraction of the mass in sub-halos, 
using M500, are near 30%. For the higher mass halos this 
declines to < 1% and 10% respectively. This suggests that 
all three effects are operating to some degree. 

The predi ctions of the semi-analytic models of K auff- 
mann et al. ( |1999|) , as fit by Sheth & Diaferio (|2000| ), are 
reproduced in Figs. [I], || as the lower solid line extending 
off the right of the figure. We can see in Fig. [I] that the 
semi-analytic models curve roughly agrees with our numer- 
ical results for sub-halos more massive than lO n M0, but 
has fewer "galaxies" than we have sub-halos if we include 
less massive objects. This could partly be due to limited 
mass resolution in the simulations onto which are grafted 
the semi-analytic recipes, but is also because the semi- 
analytic formalism has explicitly inefficient star-formation 
in low mass halos. 

4. SUB-HALO SPATIAL DISTRIBUTION 

Another ingredient in the halo model is that all parent 
halos have NFW (Navarro, Frenk k, White |1996[ ) profiles^, 
and that the satellite galaxy distribution follows the dark 
matter distribution. This obviously implies that different 
galaxy types trace each other within a given halo, which 
has not been shown. It remains unclear whether any type 

1 Or revised profiles which rise more steeply than r~ 1 as r —* 0. 



of galaxy follows the mass distribution. 

To investigate this question we have calculated the mean 
number of sub-halos in radial bins, scaled to rsoo of the 
parent halo. Fig. || shows that there is an enhanced prob- 
ability for sub-halos to reside at the center of the parent 
halo (i.e. one galaxy always resides at the center of the 
dark matter halo) and then the probability is rather flat 
with radial distance from the center. Because FOF with 
a linking length of 0.15 can occasionally link neighboring 
structures, a few sub-halos are found beyond the virial 
radius. 

For comparison we show the cumulative mass within r 
for two NFW profiles, one with c = r2ao/fs — 5 and one 
with c = 10. The total mass is (arbitrarily) normalized to 
unity at r2oo ~ 2rsoo. Apart from the central enhancement 
the sub-halos do approximately seem to follow the NFW 
profile, although they seem to be slightly more centrally 
concentrated than NFW would predict. 

Very few of our halos have enough sub-halos for a robust 
determination of the velocity distribution of the sub-halos. 
However since the dark matter and the sub-halos live in the 
same potential and have similar spatial distributions, we 
expect that any degree of "velocity bias" would be small. 

5. CONCLUSIONS 

Many of the features of the power spectrum of den- 
sity fluctuations in the universe can be simply understood 
in a model based on virialized halos. Important ingre- 
dients in the model are that the halos be biased tracers 
of the linear power spectrum and have a uniform profile 
with a correlation between the internal structure and the 
mass which should span a wide range sampling a Press- 
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Fig. 2.— As for Fig. [j]but at z = 3. 
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Fig. 3. — The number of sub-halos per FOF halo in the 
simulation at z = 1.0 as a function of radius from the 
center of the parent halo. All radii are scaled to the rsoo 
of the parent halo. We have binned the parent halos by 
log 10 (M/ft _1 Mo): (open squares): 10 — 10.5, (solid trian- 
gles) 10.5 - 11, (open circles) 11 - 11.5, (stars) 11.5 - 12. 
The dashed lines are cumulative mass within r for two 
NFW profiles with c = 5 and c = 10, normalized to unity 
at r 2 oo- 
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Schechtcr ( 1974 ) like mass function, 
it 



In fti-Hnr to understand the dustering of galaxies within 



This Quantity can be predicted by semi-analytic models of 
galaxy formation or estimated from observations. Here 
we have presented an estimate of this function spanning 
a wide range in halo masses from hydrodynamical sim- 
ulations including cooling, star formation and feedback. 
(Previous efforts to study galaxy clustering d irectly from 
hydrodynamica l simu l ation s [e.g. Katz e t al 
Gardner et al 



1997 



1999 



199£ 



1992, 1999 



Pearce et 



Dave et al. 

al. 1999[ typically had insufficient dynamic range to esti- 
mate the galaxy occupation number over such a wide range 
in halo mass.) 

We find that the number of galaxies per halo is very 
close to a Poisson distribution, with a mean which grows 
more slowly than the mass of the parent halo. This holds 
true regardless of whether we select sub-halos based on to- 
tal mass, stellar mass or star formation rate. The spatial 
distribution of sub-halos within their parent halos appears 
to follow the distribution of the total mass relatively well, 
with the number of sub-halos per spherical shell being ap- 
proximately constant at large radius. Closer to the center 
of the halo there are more sub-halos than a halos-follow- 
mass model would predict. 
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